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O ■ ABSTRACT 

Oh 1 

The sizes and shapes of voids in a galaxy survey depend not only on the physics of 
2 ■ structure formation, but also on the sampling density of the survey and on the algorithm 

used to define voids. Using an N-body simulation with a rCDM power spectrum, we 
study of the properties of voids in samples with different number densities of galaxies, 
• i-H \ both in redshift space and in real space. When voids are defined as totally empty 

regions of space, their characteristic volume is strongly dependent on sampling density; 
when they are defined as regions whose density is 0.2 times the mean galaxy density, 
the dependence is less strong. We compare two void-finding algorithms, one in which 
voids are nonoverlapping spheres, and one, based on the algorithm of Aikio & Mahonen, 
which does not predefine the shape of a void. Regardless of the algorithm chosen, the 
characteristic void size is larger in redshift space than in real space, and is larger for low 
sampling densities than for high sampling densities. We define an elongation statistic Q 
which measures the tendency of voids to be stretched or squashed along the line of sight. 
Using this statistic, we find that at sufficiently high sampling densities (comparable to 
the number density of L > L* galaxies), large voids tend to be slightly elongated along 
the line of sight in redshift space. 

Subject headings: cosmology: theory - galaxies: distances and redshifts - large-scale 
structure of universe 
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1. Introduction 

When the positions of galaxies are mapped in redshift space, the nature of the large-scale 
structure seen in the maps is "frothy" or "bubbly". Voids - regions in which there are few or 
no galaxies - fill much of the map. The galaxies exist mainly in thin filaments or sheets that lie 
between the voids. Comparison of the observed pattern of voids with that predicted by a given 
model of structure formation is potentially a very powerful test for acceptance or rejection of that 
model. Unfortunately, the differences between models are often subtle. Many models for structure 
formation predict a bubbly galaxy distribution. In the gravitational instability scenario, regions 
that are originally underdense expand faster than the Hubble flow (Fillmore & Goldreich 1984; 
Bertschinger 1985). As underdense regions expand, they become more nearly spherical (Fujimoto 
1983; Icke 1984; Blaes, Goldreich, & Villumsen 1990). Whether voids in the actual universe are 
viewed as isolated spherical structures or as a space-filling foam depends on the density level at 
which voids are defined (Gott, Melott, & Dickinson 1986). Van de Weygaert & van Kampen 
(1993) did extensive studies of voids in constrained Gaussian fields, finding that voids become 
nearly spherical in their very underdense inner regions, while their boundary regions remain more 
irregular. 

If structure grows via gravitational instability, then the size and shape of voids depends on 
the initial power spectrum P(k) for the density fluctuations and on the density parameter Q. In 
an open universe, void evolution stops when Q ~ 0.5; thereafter, the void network simply expands 
along with the Hubble flow (Regos & Geller 1991). At a given epoch, the mean void radius is 
proportional to the nonlinearity scale fc n i (Kauffmann & Melott 1992). The full distribution of void 
sizes, however, depends on the shape of the spectrum P(k) (Melott 1987; Ryden & Melott 1996). 

If we know with absolute accuracy the position of galaxies in real space, we could use the 
spectrum of void sizes to place constraints on the initial power spectrum P(k). However, measuring 
the distances to galaxies is difficult; it is much easier to measure their redshifts. Consequently, 
practical studies of the properties of voids must take place in redshift space rather than real space. 
If all galaxies smoothly followed the Hubble flow, with no peculiar velocities, and if the Hubble 
constant Hq were truly constant with time, then the mapping between real space and redshift space 
would be linear. Generally, though, the Hubble constant changes with time, and structure that is 
isotropic in real space becomes distorted along the line of sight in redshift space, with the amount 
of distortion increasing with redshift z (Alcock Sz Paczyhski 1979). Potentially, distortions in the 
shapes of voids in redshift space can provide an estimate of the deceleration parameter qo (Ryden 
1995). However, the distortions that result from cosmological effects become large only when z ~ 1. 
In the nearby universe, where z < 1, the dominant contribution to distortions from redshift space 
comes from the peculiar velocity of galaxies. Ryden & Melott (1996) demonstrated, for instance, 
that in two-dimensional simulations with P(k) oc k n , the characteristic void size is larger in redshift 
space than in real space. Redshift space distortions can also, in principle, be harnessed to measure 
Cl m , the density in nonrelativistic matter (Melott et al. 1998). One goal of this paper is to extend 
the analysis of Ryden & Melott (1996) to three-dimensional simulations with more realistic CDM 
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power spectra. 

Analyzing the properties of voids first requires a definition of what a void is. Some of the 
statistics used to describe voids define a void as a region totally devoid of galaxies. The void 
probability function (VPF) is one such statistic; the VPF Po(V) is the probability that a randomly 
positioned sphere of volume V contains no galaxies (White 1979). However, the random dilution of 
a point process (selecting only a fraction of all the galaxies in the universe, for instance), although 
it leaves the correlation function unchanged, strongly affects VPF (Sheth 1996). Thus, from a 
practical viewpoint, it makes more sense to define a void which is underdense with respect to the 
average number density of galaxies. Little & Weinberg (1994), for instance, use the underdense 
probability function (UPF). The UPF Pf(V) can be defined as the probability that a randomly 
positioned sphere of volume V contains a number density of galaxies less than or equal to / times 
the average number density of galaxies in the entire survey. We will follow Little & Weinberg (1994) 
in setting a void threshold of / = 0.2, matching the density contrast of the largest voids in the 
CfA redshift survey (Vogeley, Geller, & Huchra 1991; Vogeley et al. 1994). One advantage of the 
UPF over the VPF is that it is relatively insensitive to the sparseness with which the galaxies are 
sampled. In this paper, we will explicitly compare the UPF and VPF for different galaxy sampling 
densities. 

In addition to statistical measurements such as the VPF and UPF, algorithms exist that 
identify individual voids within a sample (Kauffmann & Fairall 1991; Kauffmann & Melott 1992; 
Ryden 1995; Ryden & Melott 1996; El- Ad & Piran 1997; Aikio & Mahonen 1998). In addition to 
providing a spectrum of void sizes, these void-detection algorithms also enable us to specify the 
location and shape of individual voids. In this paper, we first use an algorithm (an extension of 
the two-dimensional algorithm of Ryden [1995]) which defines voids as non-overlapping underdense 
spheres. Although this algorithm is conceptually simple, it has the disadvantage of forcing voids 
to be spherical. Thus, we also employ a more sophisticated algorithm, based on that of Aikio & 
Mahonen (1998; AM), which does not constrain voids to be any particular shape. The original AM 
algorithm defines voids as empty regions; our modification of the original defines voids as underdense 
regions. For the voids found by the modified AM algorithm, we introduce an elongation statistic 
Q which measures whether the voids are stretched or squashed along the line of sight from the 
origin. Just as with the UPF and VPF, we also apply the void-detection algorithms to different 
sampling densities of the same survey. Our studies of void properties in numerical simulations 
will permit a more effective interpretation and understanding of voids in future three-dimensional 
redshift surveys. 

In section 2, we describe the N-body simulation analyzed in this paper. In section 3, we 
examine the statistical properties of voids, as given by the VPF and UPF. In section 4, we apply 
void-detection algorithms to the simulations. Finally, in section 5, we analyze the effects of peculiar 
velocities and galaxy sampling density on void properties, and discuss implications for future work. 
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2. The Numerical Simulation 

The simulation used for testing in this paper was done using a particle-mesh (PM) N-body 
simulation. The PM method is quite fast, and with a mean particle density of one per simulation 
cell, represents the maximum resolution that can be achieved without introducing two-body scat- 
tering that decouples the result from its initial conditions on small scales (Kuhlman, Melott, & 
Shandarin 1996; Splinter et al. 1998). The simulation used here had 256 3 particles on a 256 3 mesh. 
Initial conditions were generated by Fast Fourier Transform with random phase perturbations. 
Since behavior at high density peaks is not of interest to us in our study of voids, the simulations 
were begun with an RMS density of 0.25 at the resolution limit. The simulations were evolved until 
the RMS overdensity inside a randomly located sphere of radius 8/i -1 Mpc was ag = 1.05. 

A matter-dominated Friedman-Robertson- Walker background density was assumed, with the 
cosmological constant A set equal to zero. Since nonlinear modes are filtered out, and the dynamical 
effect of nonzero is well-understood in perturbation theory (Lahav et al. 1991), we did not use 
nonzero values of A. The value of Q m , the dimensionless matter density, was taken to be O m = 1. 
The box size was take to be 1536 Mpc and the Hubble constant h = i^o/lOOkms -1 Mpc -1 = 0.75. 
Thus, the box size in redshift space corresponds to 1.152 x 10 5 kms _1 = 0.384c. Formally, in the 
simulation h only sets an overall timescale; since both the expansion rate and the particle velocities 
scale with h, the redshift space appearance does not change with h, only its overall scale. 

The initial power spectrum we assume is a cold dark matter (CDM) spectrum, in which the 
parameter T determines the shape of the spectrum. Smaller values of T are favored today, because 
they push the turnover in the slope of the power spectrum to large scales, in better agreement with 
observations. To test our void-finding algorithms, we use T = 0.25. Normally, T is taken to be Qoh, 
as the turnover scale is set by the horizon at the end of the radiation-dominated era. However, 
we break this assumed coupling and take T as a free parameter descriptive of the spectral shape. 
Thus, our model corresponds to what is sometimes called rCDM. 

The number density of mass points in the simulation, n = 4.63 x 10~ 3 Mpc -3 , corresponds to 
the number density of galaxies with L > 0.42L*, assuming that the luminosity function of galaxies is 
a Schechter function with slope a = —1.07 and normalization <!>* = 6.75 x 10~ 4 Mpc~ 3 (Efstathiou, 
Ellis, & Peterson 1988). One purpose of this paper is to investigate the dependence of void properties 
on the sampling density of a galaxy survey. Thus, from the initial numerical simulation, we have 
created three different samples, corresponding to volume-limited surveys with different luminosity 
cutoffs. To create our densest sample, we randomly selected a fraction X = 0.82 of the initial 
mass points, in order to match the number density of galaxies with L > L*/2. For the next densest 
sample, we selected a fraction X = 0.31 of the mass points, to match the number density of galaxies 
with L > L*. Finally, for our least dense sample, we selected a fraction X = 0.067 of the mass 
points, to match the density of galaxies with L > 2L*. Note that by randomly selecting galaxies 
in this way, we are assuming that galaxies are unbiased with respect to the mass distribution, and 
that luminosity segregation does not exist. We wish to study only the effects of different sampling 
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densities, and not the more subtle effects of bias. 

For each sampling density, we create two mock galaxy surveys, one without peculiar velocities 
(which we will call the "real space" survey) and one with peculiar velocities (the "redshift space" 
survey). The mock surveys are spheres with a radius of 480 Mpc, or z = 0.12 in redshift units. Note 
that in a flat Q m = 1 universe, a galaxy with L = L*/2 located at z = 0.12 will have an apparent 
magnitude of tub ~ 19, assuming = —19.7 + 51og/i (Efstathiou, Ellis, & Peterson 1988). For 
a typical galaxy color of r' — B —1, this roughly corresponds to the flux limit r' = 17.7 expected 
for the galaxy redshift sample of the Sloan Digital Sky Survey (Weinberg 2000). A slice through 
the L > "real space" survey is shown in Figure la; a slice through the L > L* "redshift space" 
survey is shown in Figure lb. 

3. Statistics of Voids 

The void probability function, or VPF (White 1979) has been a widely used statistic for 
measuring the characteristic size of voids. The VPF Po(V) is defined as the probability that a 
randomly located sphere of volume V contains no galaxies. The VPF has been applied to numerical 
simulations (Fry et al. 1989; Einasto et al. 1991; Weinberg & Cole 1992; Little & Weinberg 1994; 
Vogeley et al. 1994; Ghigna et al. 1994, 1996; Colombi, Bouchet, k, Hernquist 1996; Kauffmann, 
Nusser, k, Steinmetz 1997) and to redshift surveys (Fry et al. 1989; Einasto et al. 1991; Vogeley, 
Geller, & Huchra 1991; Vogeley et al. 1994; Ghigna et al. 1996). The VPF for the simulation studied 
in this paper is given in the left-hand panels of Figure 2. The volumes plotted are in redshift units; to 
convert to physical units, multiply by (c/i^o) 3 = 6.4 x 10 10 Mpc 3 . In each panel, the solid line shows 
the VPF in real space, and the dotted line shows the VPF in redshift space. The effect of peculiar 
velocity distortions is to increase the void probability at a given volume V. A comparison of the 
VPF for the different sampling densities, however, vividly illustrates the very strong dependence 
of the VPF on the mean interparticle spacing. To illustrate, let's define a characteristic void 
volume Vvpf as the volume for which Po(Wpf) = 0.01. For a Poisson distribution of points, 
Po(V) = ex P( — vV), where v is the mean number density, and thus VVpf = 4.61/za For our 
L > L*/2 sample, V VPF = 4.0 x 10~ 8 = 9.6/ v; for our L > sample, V VPF = 1.14 x 10~ 7 = 10.3/z/; 
and for our sparse L > 2L* sample, Vvpf = 4.3 x 10 -7 = 8.3/za Thus, the voids in the simulation 
are larger than those in a Poisson distribution of equal v, but it is still approximately true that the 
characteristic void size (defining voids as totally empty volumes) is proportional to v~ x . 

The dependence of characteristic void size on the sample density v is reduced if we use the 
underdense probability function (UPF) to measure the statistics of voids. The UPF Pf(V) is 
defined as the probability that a randomly located sphere of volume V contains a number density 
of galaxies that is less than fi/, where < / < 1, and v is the mean number density of galaxies in the 
sample. (In a flux-limited survey, we can generalize the UPF so that P/(V) is the probability that 
a sphere of volume V located at a distance r from the origin contains a number density of galaxies 
that is less than fv(r), where v(r) is the mean number density of detected galaxies at r.) Following 
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previous work (Vogeley, Geller, & Huchra 1991; Weinberg & Cole 1992; Ryden & Melott 1996), we 
will set our density threshold at / = 0.2. The UPF for the simulation studied in this paper is given 
in the right-hand column of Figure 2; in each panel, the solid line shows the UPF in real space and 
the dotted line shows the UPF in redshift space. Again, the effect of peculiar velocity distortions 
is to increase the probability of finding a void of volume V. For volumes V < l/(fu), the UPF 
and the VPF are identical, since such a small volume can only fall below the density threshold if 
it contains no galaxies. In general, a sphere of volume V will be underdense if it contains at most 
M galaxies, where M = mt(fvV). When M is small, the UPF shows discreteness effects, visible 
as the sawtooth pattern in Figure 2; the UPF jumps upward whenever V is an integral multiple of 

!/(/")■ 

The characteristic void size Vupf is defined as the volume for which Po.2(^upf) = 0.01. (The 
choice of Pq = 0.01 as the defining probability is somewhat arbitrary; however, because of the 
rapid decline of Po at large volumes, the exact value of Po is not crucial.) For our dense L > L*/2 
sample, Vupf = 2.7 x 10~ 7 = 64/z/; for the L > sample, Vupf = 4.2 x 10~ 7 = 38/v; for the 
sparse L > 2L* sample, Vupf = 9.2 x 10~ 7 = 17.7 jv. The inclusion of peculiar velocity distortions 
increases Vupf by a factor which ranges from 2.5 for the densest sample to 1.4 for the sparsest 
sample. Although the dependence of the underdense void size Vupf on v is less strong than that 
of Vvpf, it is not true that Vupf is independent of v for plausible sampling densities. Thus, in a 
flux-limited survey, where the measured v decreases with distance from the origin, the characteristic 
void size Vupf will increase with distance. 



4. Void-detection Algorithms 

The UPF gives a statistical measure of the number and size of voids in a given galaxy dis- 
tribution. Frequently, however, it is useful to identify individual voids instead of simply giving a 
statistical description. Many different algorithms have been used to detect and identify individual 
voids (Kauffmann & Fairall 1991; Kauffmann & Melott 1992; Ryden 1995; Ryden & Melott 1996; 
El- Ad & Piran 1997; Aikio & Mahdnen 1998). In this paper, we will investigate two void-detection 
algorithms that are distinguished by their ease of use and clarity of conception. The first algo- 
rithm, based on that of Ryden (1995), identifies voids as nonoverlapping spheres; we will call this 
the "sphere algorithm". The second algorithm, based on that of Aikio and Mahohen (1998; AM) 
permits voids to be nonspherical; we will call this the "AM algorithm" . 

Both void-detection algorithms start by defining a continuous scalar field Df(x) within the 
galaxy survey. At any location x, Df is defined as the radius of the largest sphere centered on x 
within which the average galaxy density is equal to jv. To implement the "sphere algorithm", first 
locate the global maximum of Df within the survey; call the location of this maximum x\. This 
is the center of the largest spherical void in the sample, which has a radius Df{x\). To find the 
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second largest void, find the point X2 for which Df is maximized, subject to the constraint that 



In other words, the second-largest void is the largest underdense sphere that doesn't overlap the 
largest void. Additional voids are found by an iterative process. The ./V th largest void is located at 



for i = 1,2,...,JV-1. 

In practice, we compute the values of Df on a cartesian grid superimposed on the galaxy 
distribution. For the simulation used in this paper, the grid spacing we used was Ax = 0.0012 in 
redshift units (Ax = 4.8 Mpc in physical units). This is 0.8 times the resolution of the original 
numerical simulation. Using a grid very much finer than that of the original simulation is pointless, 
since there is no information on such small scales. The computed value of Df for each grid point 
was forbidden to be larger than the distance from the grid point to the sample boundary at z=0.12. 
We then located the spherical voids, using the algorithm outlined above, subject to the additional 
constraint that the void centers lie on grid points. Since the discreteness of the superimposed grid 
creates errors of order Ax in the location of void centers, we halt the void-detection algorithm when 



To emphasize the difference between the properties of totally empty voids and underdense 
voids, we implemented the sphere algorithm twice, once with / = 0, and once with / = 0.2. To 
illustrate the voids found with the sphere algorithm, Figure 3 shows a slice through the underdense 
spheres found in the L > sample, setting / = 0.2. Figure 3a shows the spherical voids found in 
real space (without peculiar velocity distortions) and Figure 3b shows the spherical voids found in 
redshift space (with peculiar velocity distortions). 

To show the spectrum of void sizes found with the sphere algorithm, Figure 4 plots Ff(V), the 
fraction of the total volume of the sample found in spherical voids with volume > V. The right 
column of Figure 4 displays Fq.2, the fraction of the total volume found in voids with underdensity 
/ = 0.2. For comparison, the left column of Figure 4 gives Fq, the fraction of the total volume 
found in totally empty voids. The solid line in each panel gives the distribution in real space, and 
the dotted line gives the distribution in redshift space. The left column of Figure 4 demonstrates, as 
in the case of the VPF, that the characteristic size of empty voids depends strongly on the density 
v of the sample. Because of this undesired feature, we will only examine in detail the properties of 
underdense voids. 

A characteristic void size can be found, following the practice of Kauffmann & Melott (1992), 
by computing the volume-weighted mean void size, 




the position xV for which Df is maximized, subject to the constraint that 

Df(xN) + Df(xi) > \xn - Xi\ , 



(2) 



Df = 2Ax. 




(3) 



i 



i 
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where Vi is the volume of the i largest void, and the total number of voids is Ny. Alternatively, 
we can define a characteristic void size Vx which is the void volume such that X percent of the 
total volume in the sample is contained in voids of size Vx or bigger. The characteristic void sizes 
V and V20 are given in Table 1 for underdense voids in samples of different number density, with 
and without peculiar velocities. 

The characteristic size of spherical voids (whether defined as V or V20) increases in going from 
real space to redshift space. The more densely sampled the survey, the greater the increase in void 
size. For the L > 2L* sample, V increases by 43% in going from real space to redshift space, and 
V20 increases by 40%. For the L > L*/2 sample, V increases by 88% in going from real space to 
redshift space, and V20 by 155%. 

Although the sphere algorithm gives a rough estimate of the spectrum of void sizes, the volumes 
of the voids found by this algorithm will generally be underestimates of the "true" void size. That is, 
if a sample contains an empty region surrounded by a continuous, well-defined, extremely overdense 
wall, the void found by the sphere algorithm will be the largest sphere that can be inscribed within 
the wall. The remaining empty space within the void wall will then be iteratively filled with smaller 
and smaller spheres. A more flexible algorithm - one which doesn't impose the artificial constraint 
that voids are spherical - should give a more accurate measure of void size. 

In addition to underestimating the size of voids, the sphere algorithm gives no hint of the true 
void shape. In two dimensions, Ryden (1995) and Ryden & Melott (1996) estimated the shapes of 
voids by fitting ellipses to the underdense region. Each void was then characterized by an axis ratio q 
and a position angle <f> m addition to its area a and position (x, y). Extending this algorithm to three 
dimensions by fitting ellipsoids to the underdense regions becomes a computationally daunting task. 
Each ellipsoidal void must be characterized by two axis ratios and three Euler angles in addition 
to its volume V and position (x, y, z). The introduction of additional parameters makes the search 
through parameter space far more time-consuming. Thus, instead of approximating the shape of 
voids by fitting ellipsoids to them, we adopted the more flexible scheme of Aikio & Mahonen (1998; 
AM). 

To implement the AM algorithm, we start with the field Df as defined on the cartesian grid 
that we have superimposed on the galaxy survey. If the grid spacing is Ax, then each grid point can 
be thought of as being at the center of a cubical "elementary cell" of volume (Ax) 3 . We identify 
the local maxima of Df on the grid as being those points which have values of Df greater than that 
of their 26 closest neighbors (the 6 points at a distance Ax, the 12 points at a distance y/2Ax, and 
the 8 points at a distance y/3Ax). We label the local maxima we find as Mi, M2, . . ., Mjy, where 
N is the total number of maxima located. The AM algorithm assigns every elementary cell to a 
"subvoid" associated with some maximum Mj. To discover which subvoid a particular elementary 
cell belongs to, a "climbing algorithm" is used. For a elementary cell X, we compute the gradient in 
Df to each of the neighboring cells. We then "climb" to elementary cell X' for which the gradient 
has the largest (positive) value. The climbing continues from cell to cell until a local maximum 
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Mj is reached. The cell X (and every other cell along the climbing route) is then assigned to the 
subvoid of maximum Mj. In this way, every elementary cell is assigned to a subvoid, and each 
maximum Mj has an associated subvoid which consists of at least one elementary cell. Once every 
elementary cell is assigned to a subvoid, the subvoids are joined together into larger voids. The 
subvoids associated with maxima Mj and Mj are members of the same void if the distance between 
Mi and Mj is less than both Df(Mi) and Df(Mj). Using this criterion, all the subvoids are grouped 
into voids; some voids contain a single subvoid, while others contain many subvoids linked together 
in a friends-of-friends percolation. The AM algorithm and its implementation is described in more 
detail in the original paper by Aikio & Mahonen (1998); our modification is to use the underdensity 
field Df, where AM restricted themselves to the case / = 0. 

Figure 5a shows a slice through the underdense voids in the L > sample, without the 
inclusion of peculiar velocity distortions. The AM algorithm was used with a density threshold 
/ = 0.2. In each panel of Figure 5, an elementary cell is colored white if it belongs to the same 
void as its 26 nearest neighboring cells; it is colored black if one or more of its neighbors belongs 
to a different void. Figure 5b shows a slice through the underdense voids in the L > L* sample, 
this time with the effects of peculiar velocities included. 

The volume V of an individual void found by the AM algorithm can be found by simply adding 
together the volume (Ax) 3 of the elementary cells which it contains. The full spectrum of void 
sizes in a particular galaxy sample is given by Ff(V), the fraction of the total volume of the sample 
found in voids with volume > V. The left column of Figure 6 shows Fq, the distribution of void 
volumes using the original algorithm of AM, in which / = 0. The right column of Figure 6 displays 
F0.2, using our usual underdense criterion, / = 0.2. Some properties of the AM voids are the similar 
to those of the spherical voids found earlier. For instance, the left column of Figure 6 shows that 
the characteristic size of empty AM voids (just like those of spherical voids) is strongly dependent 
on the density v of the sample. Also, peculiar velocities increase the characteristic size of AM voids 
as well as of spherical voids. Using the AM algorithm with / = 0.2, the volume-weighted mean 
void size V and the characteristic size V20 are given in Table 1. For the sparsely sampled survey 
(L > 2L*), V increases by 58% in going from real space to redshift space, and V20 increases by 
66%. For the densely sampled survey (L > L*/2), V increases by 79% in going from real space 
to redshift space, and V20 increases by 82%. In addition, AM voids as well as spherical voids are 
larger at lower sampling densities. Given a density threshold of / = 0.2, the characteristic void 
size (either V or V20) for the sparsely sampled survey (L > 2L 1f ) is 2.5 times that of the densely 
sampled survey (L > L*/2). 

One important difference between the voids found by the AM algorithm and those found by 
the sphere algorithm is that the AM voids are larger. At all sampling densities, it is found that 
V( AM)/V( sphere) ~ 2.7. The ratio V 2 q{ AM)/ V 2 q( sphere) ranges from ~ 6 for the L > 2L* survey 
through ~ 8 for the L > L* survey to ~ 14 for the L > L*/2. Thus, the answer to the seemingly 
innocuous question, "How large is a typical void?" depends not only on the sampling density of 
the galaxy survey and on whether the survey is done in real space or in redshift space, but also on 
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the void-finding algorithm used and on the definition adopted for the typical void size. 

Another important difference between the voids found by the AM algorithm and those found 
by the sphere algorithm is that the AM voids are not compelled to be spherical. Hence the shapes 
of the AM voids can be used as a measure of the shape of voids in the galaxy distribution. It is of 
particular interest to discover whether voids are distorted along the line of sight from the observer 
at the origin. At relatively small redshifts (z < 1), the dominant source of distortion in redshift 
maps is the peculiar velocities of galaxies. In examining two-dimensional simulations with power 
spectra P a P, Ryden & Melott (1996) found only a mild tendency for voids to be distorted or 
compressed along the line of sight. With an n = 2 power spectrum, the largest voids were slightly 
elongated along the line of sight; with an n = spectrum, voids were slightly compressed along the 
line of sight. 

To find whether the AM voids are preferentially elongated or compressed along the line of 
sight, we begin by computing the moments of the voids. If a void contains N elementary cells, with 
the center of the i th cell at (xi,yi, zi), the moments of the void can be computed as 



The "center of mass" of the void, weighting all elementary cells equally, is at (x,y,z). For a given 
void, we can create a new coordinate system, with its origin at (x,y,z), with its x' axis passing 
through the location of the observer, and with its y' and z' axis perpendicular to the x' axis. If we 
know the coordinates (xi,yi,Zi) of a mass element in the old coordinate system (centered on the 
observer), we can compute the coordinates {x^y'^ z[) in the new coordinate system (centered on 
the void center). 

A dimensionless measure of a void's elongation or compression along the line of sight is 



The quantity Q has some useful properties. Its denominator is independent of the orientation of 
the coordinate system; it's simply the mean square distance of all the elementary cells from the 
void center. For a void of arbitrary shape, the mean value of Q, averaged over all viewing angles, 
is < Q >= 1. Thus, for a population of voids oriented randomly with respect to the observer, we 
expect the average value of Q to be 1. A value Q > 1 indicates that a void is elongated along the 
line of sight; a value Q < 1 indicates that a void is compressed along the line of sight. For any void, 
in any orientation, < Q < 3. As an example, consider a triaxial ellipsoid with principal axes of 
length a > b > c. The maximum value of Q for this ellipsoid occurs when the long axis is aligned 
with the line of sight from the observer to the ellipsoid's center. In this case, 




(4) 



i=i 



Q = 



3(x') 2 



(5) 



(x'f + {y'f + (z'f 



Q 



3a 2 



> 1. 



(6) 



a 



2 + b 2 + c 2 - 
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The minimum value of Q occurs when the short axis is aligned with the line of sight. In this case, 

3c 2 

Qmin = o j 7o~. 9 — 1- v) 

a z + b z + c 2 

For the voids found by the AM algorithm, the denominator of Q can be written as 

^2 



(x'f + (y<) 2 + (z>) 2 = D 2 -D\ (8) 
where 



and 



The numerator can be written as 



D 2 =x 2 +y 2 + z 2 (9) 



D 2 = x 2 + y 2 + z 2 . (10) 



3(x') 2 = —D + x x 2 + y y 2 + z z 2 + 2x y xy + 2y z yz + 2z x zx . (11) 
D 1 J 

To compute the distribution of Q as a function of void volume V, we first identify voids using the 
AM algorithm with a density threshold / = 0.2; to eliminate distortions caused by the artificial 
boundary conditions at z max = 0.12, we consider only those voids at redshifts z < 0.11. The mean, 
fiQ, and the standard deviation, oq, of Q for the voids found in this way are plotted in Figure 7, as 
a function of void volume V. The values of fiQ and gq are computed in bins containing 400 voids 
apiece. Thus, the expected error in the mean value of Q for each bin is = <tq/V 400 = 0.05cjq. 
The value of fiQ indicates whether voids are preferentially oriented with respect to the line of sight; 
the value of oq is a measure of the intrinsic asphericity of the voids. 

In the left column of Figure 7, the shape of voids is measured in real space, where there should 
be no preferential distortions along the line of sight. Indeed, hq is not significantly different from 
unity for voids of all sizes and at all sampling densities: in all cases, \[iq — 1| < 2.50^. There is, 
however, a significant trend in the standard deviation of Q. As voids get larger, cjq gets smaller, 
decreasing from ctq ps 0.34 at V = 5 x 10~ 8 to oq « 0.27 at V > 10~ 6 . This is a consequence of 
the fact that large voids are more nearly spherical than small voids. 

In the right column of Figure 7, the shape of voids is measured in redshift space, where the 
distortions due to peculiar velocities may cause systematic distortions along the line of sight. At 
sufficiently high sampling density (the L > and L > L*/2 samples), large voids are significantly 
elongated along the line of sight (fiQ > 1). In the L > L* sample, fiQ > 1 for volumes V > 
4 x 10~ 7 ~ 0.2F. In the L > L*/2 sample, hq > I for volumes V > 1.3 x 10~ 7 ~ O.lF; the greatest 
deviation of \iq from unity is at V = 1.25 x 10~ 6 ~ V, where hq — 1 = 6cr^. We conclude that the 
distortions along the line of sight caused by peculiar velocities are measurable by the AM algorithm 
only when the sampling density is sufficiently high (corresponding to a limiting galaxy luminosity 
of ~ L* or fainter). For our initial CDM power spectrum, voids with V > V show a significant 
tendency to be elongated along the line of sight by peculiar velocities. 



-12- 



5. Conclusion 

In examining the large scale structure of the universe, studies of overdense regions (clusters 
and superclusters) are usefully complemented by studies of underdense regions, or voids. The 
properties of voids, such as their volumes and shapes, depend on how voids are defined. Since the 
literature contains many different void definitions and void-finding algorithms, direct comparison 
of void properties found in different studies is a risky business. 

The VPF, Pq(V), depends strongly on the number density v of galaxies in the survey. Although 
the UPF, Pq.2(V), depends less strongly on sampling density than the VPF, it is still true that the 
characteristic void size Vjjpf depends on the sampling density v, with a higher density yielding a 
smaller characteristic void size. Thus, in a flux-limited survey, where the mean density of detected 
galaxies, v(z), decreases with redshift, using a threshold density fv(z) will produce a characteristic 
void size that increases with increasing redshift. This will be an artifact of the decreasing sampling 
density at large redshift, and does not reflect a change in the underlying large scale structure. 
In determining how the properties of voids depend on redshift, it is more prudent to extract a 
volume-limited sample from the data before applying a void statistic like the UPF or a void-finding 
algorithm. 

Of the void-finding algorithms outlined in this paper, the "sphere" algorithm has the virtue 
of (relative) simplicity. However, its restriction that all voids must be spherical leads to an un- 
derestimate of void size and does not permit us to measure the distortions of voids caused by 
peculiar velocities. Adapting the algorithm of Aikio & Mahonen (1998), we were able to determine 
more accurately the sizes of underdense voids. As in two-dimensional simulations (Ryden & Melott 
1996), the effect of peculiar velocities is to increase the characteristic void size. The AM algorithm 
also permits us to measure the elongation of voids along the line of sight. In real space, it is found 
that large voids are intrinsically more nearly spherical than smaller voids. This can be regarded as 
a manifestation of the tendency for large voids within a bubbly structure to expand and become 
more nearly spherical at the expense of their smaller neighbors (Regos Sz Geller 1991). In redshift 
space, large voids are seen, for the CDM spectrum used in our simulations, to have a slight but 
statistically significant tendency to be elongated along the line of sight. Note, however, that the 
void distortions can only be detected at a sufficiently high sampling density. Future redshift surveys 
such as that provided by the Sloan Digital Sky Survey will provide sufficiently high galaxy densities 
and a large enough number of voids to accurately measure the peculiar velocity distortion of voids 
in the real universe. When deeper redshift surveys are available, the (relatively small) peculiar 
velocity distortions can be subtracted out to reveal the cosmological distortions resulting from the 
deceleration of the Hubble expansion. 

This work was supported by a grant from the Ohio Supercomputer Center. ALM wishes to 
acknowledge support from the National Science Foundation under grant number AST-0070702, 
the University of Kansas General Research Fund and the National Center for Supercomputing 
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Table 1: CHARACTERISTIC VOID SIZES 



Spherical voids 



AM voids 



Sample 



V V 20 



V 



L> 



L > 2L* 



L > L*/2 



(real) 

(redshift) 

(real) 

(redshift) 

(real) 

(redshift) 



0.82E-6 0.67E-6 

1.17E-6 0.93E-6 

0.42E-6 0.21E-6 

0.71E-6 0.39E-6 

0.34E-6 0.085E-6 

0.64E-6 0.22E-6 



2.26E-6 
3.57E-6 
1.12E-6 
1.95E-6 
0.91E-6 
1.63E-6 



3.49E-6 
5.80E-6 
1.75E-6 
3.08E-6 
1.39E-6 
2.53E-6 
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Fig. 1. — (a) A slice of thickness 12/i _1 Mpc through the N-body simulation analyzed in this paper. 
The number density of mass points is chosen to be comparable to the observed number density of 
galaxies with L > L*. (b) A slice of the same thickness through the same sample, but with peculiar 
velocities added. In each slice, only galaxies with z < 0.12 are included. 
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Fig. 2. — The void probability function (left column) and underdense probability function (right 
column) for the CDM simulation. In each column, the top panel represents the most sparsely 
sampled subset of points (corresponding to L > 2L*), and the bottom panel represents the most 
densely sampled subset of points (corresponding to L > L*/2). The volume is given in dimensionless 
redshift units. 
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Fig. 3. — (a) A slice through the spherical underdense voids found in the L > sample; the 
density threshold chosen was / = 0.2. The orientation of the slice is the same as that shown in 
Figure la. (b) A slice through the spherical underdense voids in the same sample, but with peculiar 
velocities added. 
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Fig. 4. — Fraction of the total area of the simulation contained in spherical voids of volume V or 
greater. In the left column, the density threshold is / = 0, corresponding to totally empty voids. In 
the right column, the density threshold is / = 0.2. In each panel, the solid line is the void fraction 
measured in real space, and the dotted line is the void fraction measured in redshift space. 
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Fig. 5. — (a) A slice through the underdense voids found in the L > L* sample, using the AM 
algorithm; the density threshold chosen was / = 0.2. The orientation of the slice is the same as 
those in Figures la and 3a. (b) A slice through the underdense AM voids in the same sample, but 
with peculiar velocities added. 
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Fig. 6. — Fraction of the total area of the simulation contained in voids of volume V or greater, 
as defined by the AM algorithm. In the left column, the density threshold is / = 0. In the right 
column, the density threshold is / = 0.2. In each panel, the solid line is the void fraction measured 
in real space, and the dotted line is the void fraction measured in redshift space. 
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Fig. 7. — The mean (filled circles) and standard deviation (error bars) for the elongation statistic 
Q; each data point represents a bin containing 400 voids. The left column gives the results for 
voids measured in real space; the right column gives results for voids measured in redshift space. 
In each case, the density threshold is / = 0.2. 



